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MACK and GISH PATENT 
Application No.: 09/829,472 
Page 2 

In the Specification: 

Please replace the paragraph (Table 1) beginning at page 94, line 1, with the following 
(see attached pages 94-103): 

Please insert the accompanying paper copy of the Sequence Listing, page numbers 1 to 
32, at the end of the application. 

REMARKS 

Applicants request entry of this amendment in adherence with 37 C.F.R. 
§§1.821 to 1.825. This amendment is accompanied by a floppy disk containing the above 
named sequences, SEQ ID NOS:1-20, in computer readable form, and a paper copy of 
the sequence information which has been printed from the floppy disk. 

The information contained in the computer readable disk was prepared 
through the use of the software program "Patentln" and is identical to that of the paper 
copy. This amendment contains no new matter. 

Attached hereto is a marked-up version of the changes made to the 
Specification and Abstract by the current Amendment. The attached pages are captioned 
"VERSION WITH MARKINGS TO SHOW CHANGES MADE." 
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If the Examiner believes a telephone conference would expedite 
prosecution of this application, please telephone the undersigned at 415-576-0200. 
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TOWNSEND and TOWNSEND and CREW LLP 

Two Embarcadero Center, 8 th Floor 

San Francisco, California 94111-3834 

Tel: (415) 576-0200 

Fax: (415) 576-0300 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 
In the Specification: 

Paragraph (Table 1) beginning at line 1 of page 94 has been amended as follows (see 
attached pages 94-103): 
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Table 1 

BCA4 DNA sequence (SEQ ID N0:1) 

Gene name: osteoblast specific factor 2 (periostin) ; Unigene number: Hs. 136348; Probeset 

Accession #: D13666; Nucleic Acid Accession #: NM_006475; Coding sequence : 12-2522 (start 
and stop codons underlined) 

AGAGACTCAA G ATGA TTCCC TTTTTACCCA TGTTTTCTCT ACTATTGCTG CTTATTGTTA 6 0 

ACCCTATAAA CGCCAACAAT CATTATGACA AGATCTTGGC TCATAGTCGT ATCAGGGGTC 12 0 

GGGACCAAGG CCCAAATGTC TGTGCCCTTC AACAGATTTT GGGCACCAAA AAGAAATACT 180 

TCAGCACTTG TAAGAACTGG TATAAAAAGT CCATCTGTGG ACAGAAAACG ACTGTTTTAT 24 0 

ATGAATGTTG CCCTGGTTAT ATGAGAATGG AAGGAATGAA AGGCTGCCCA GCAGTTTTGC 300 

CCATTGACCA TGTTTATGGC ACTCTGGGCA TCGTGGGAGC CACCACAACG CAGCGCTATT 360 

CTGACGCCTC AAAACTGAGG GAGG AG AT CG AGGGAAAGGG ATCCTTCACT TACTTTGCAC 42 0 

CGAGTAATGA GGCTTGGGAC AACTTGGATT CTGATATCCG TAGAGGTTTG GAGAGCAACG 480 

TGAATGTTGA ATTACTGAAT GCTTTACATA GTCACATGAT TAATAAGAGA ATGTTGACCA 54 0 

AGGACTTAAA AAATGGCATG ATTATTCCTT CAATGTATAA CAATTTGGGG CTTTTCATTA 600 

ACCATTATCC TAATGGGGTT GTCACTGTTA ATTGTGCTCG AATC ATC CAT GGGAACCAGA 66 0 

TTGCAACAAA TGGTGTTGT C CATGTCATTG ACCGTGTGCT TACACAAATT GGTACCTCAA 72 0 

TTCAAGACTT CATTGAAGCA GAAGATGACC TTTCATCTTT TAGAGCAGCT GCCATCACAT 78 0 

CGGACATATT GGAGGCCCTT GGAAGAGACG GTCACTTCAC ACTCTTTGCT CCCACCAATG 84 0 

AGGCTTTTGA GAAACTTCCA CGAGGTGTCC TAGAAAGGTT CATGGGAGAC AAAGTGGCTT 900 

CCGAAGCTCT TATGAAGTAC CACATCTTAA ATACTCTCCA GTGTTCTGAG TCTATTATGG 960 

GAGGAGCAGT CTTTGAGACG CTGGAAGGAA ATACAATTGA GATAGGATGT GACGGTGACA 102 0 

GTATAACAGT AAATGGAATC AAAATGGTGA ACAAAAAGGA TATTGTGACA AATAATGGTG 108 0 

TGATCCATTT G ATTG AT C AG GTCCTAATTC CTGATTCTGC CAAACAAGTT ATTGAGCTGG 114 0 

CTGGAAAACA GCAAACCACC TTCACGGATC TTGTGGCCCA ATTAGGCTTG GCATCTGCTC 120 0 

TGAGGC CAGA TGGAGAATAC ACTTTGCTGG CACCTGTGAA TAATGCATTT T CTGATGAT A 126 0 

CTCTCAGCAT GGTTCAGCGC CTCCTTAAAT TAATTCTGCA GAATCACATA TTGAAAGTAA 132 0 

AAGTTGGCCT TAATGAGCTT TACAACGGGC AAATACTGGA AACCAT CGGA GGCAAACAGC 13 80 

TCAGAGTCTT CGTATATCGT ACAGCTGTCT GCATTGAAAA TTCATGCATG GAGAAAGGGA 1440 

GTAAGCAAGG GAGAAACGGT GCGATTCACA TATTCCGCGA GATCATCAAG CCAGCAGAGA 1500 

AATCCCTCCA TGAAAAGTTA AAACAAGATA AGCGCTTTAG CACCTTCCTC AGCCTACTTG 1560 

AAGCTGCAGA CTTGAAAGAG CTCCTGACAC AACCTGGAGA CTGGACATTA TTTGTGCCAA 162 0 

CCAATGATGC TTTTAAGGGA ATGACTAGTG AAGAAAAAGA AATTCTGATA CGGGACAAAA 1680 

ATGCTCTTCA AAACATCATT CTTTAT CACC TGACACCAGG AGTTTTCATT GGAAAAGGAT 174 0 

TTGj^CCTGG TGTTACTAAC ATTTTAAAGA CCACACAAGG AAGCAAAATC TTTCTGAAAG 1800 

AAGTAAATGA TACACTTCTG GTGAATGAAT TGAAATCAAA AGAATCTGAC ATCATGACAA 186 0 

CAAATGGTGT AATTCATGTT GTAGATAAAC TCCTCTATCC AGCAGACACA CCTGTTGGAA 192 0 

ATGATCAACT GCTGGAAATA CTTAATAAAT TAATCAAATA CATCCAAATT AAGTTTGTTC 1980 

GTGGTAGCAC CTTCAAAGAA ATCCCCGTGA CTGTCTATAC AACTAAAATT ATAACCAAAG 2 04 0 

TTGTGGAACC AAAAATTAAA GTGATTGAAG GCAGTCTTCA GCCTATTATC AAAACTGAAG 2100 

GACCCACACT AACAAAAGTC AAAATTGAAG GTGAACCTGA ATTCAGACTG ATTAAAGAAG 2160 

GTGAAACAAT AACTGAAGTG ATC C ATGGAG AGCCAATTAT TAAAAAATAC ACCAAAATCA 222 0 

TTGATGGAGT GCCTGTGGAA ATAACTGAAA AAGAGACACG AGAAGAACGA AT CATTACAG 2280 

GTCCTGAAAT AAAATACACT AGGATTTCTA CTGGAGGTGG AGAAACAGAA GAAACTCTGA 2 34 0 

AGAAATTGTT ACAAGAAGAG GTCACCAAGG TCACCAAATT CATTGAAGGT GGTGATGGTC 2 400 

ATTTATTTGA AGATGAAGAA ATTAAAAGAC TGCTTCAGGG AGACACACCC GTGAGGAAGT 2460 

TGCAAGCCAA CAAAAAAGTT CAAGGTTCTA GAAGACGATT AAGGGAAGGT CGTTCTCAGT 2 520 

GAAAATCCAA AAACCAGAAA AAAATGTTTA TACAACCCTA AGTCAATAAC CTGACCTTAG 2 58 0 

AAAATTGTGA GAGCCAAGTT GACTTCAGGA ACTGAAACAT CAGCACAAAG AAGCAATCAT 264 0 

CAAATAATTC TGAACACAAA TTTAATATTT TTTTTTCTGA ATGAGAAACA TGAGGGAAAT 2 700 

TGTGGAGTTA GCCTCCTGTG GTAAAGGAAT TGAAGAAAAT ATAACACCTT ACACCCTTTT 2 760 

TCATCTTGAC ATTAAAAGTT CTGGCTAACT TTGGAATCCA TTAGAGAAAA ATCCTTGTCA 2 820 

CCAGATTCAT TACAATTCAA ATCGAAGAGT TGTGAACTGT TATCCCATTG AAAAG AC CG A 288 0 

GCCTTGTATG TATGTTATGG ATACATAAAA TGCACGCAAG CCATTATCTC T C CATGGGAA 2 94 0 

GCTAAGTTAT AAAAATAGGT GCTTGGTGTA CAAAACTTTT TATATCAAAA GGCTTTG C AC 3 000 

ATTTCTATAT GAGTGGGTTT ACTGGTAAAT TATGTTATTT TTTACAACTA ATTTTGTACT 3 060 

CTCAGAATGT TTGTCATATG CTTCTTGCAA TGCATATTTT TTAATCTCAA ACGTTTCAAT 3120 

AAAACCATTT TTCAGATATA AAGAGAATTA CTTCAAATTG AGTAATT CAG AAAAACTCAA 318 0 
GATTTAAGTT AAAAAGTGGT TTGGACTTGG GAA 

BCA4 Protein sequence (SEQ ID NO: 2) 

Gene name: osteoblast specific factor 2 (periostin); Unigene number: Hs. 136348; Probeset 
Accession #: D13666; Protein Accession #: NP_006466; Predicted Signal sequence: 1-21; TM 
domains: none; PFAM domains : f asciclin_domains : 94-232, 234-367, 496-630; Summary: a 

secreted protein involved in adhesion and osteoblast development; may participate in 
preferential metastasis of breast cancer to bone. 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 6 0 

KNWYKKS I CG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 12 0 

KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLE SNVNVE LLNALHSHMI NKRMLTKDLK 180 

NGMIIPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF 240 

IEAEDDLSSF RAAAITSDIL. EALGRDGH FT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 

MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

IDQVLIPDSA KQVIELiAGKQ QTT FTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
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VQRLLKLILQ 
RNGAIHIFRE 
FKGMTSEEKE 
TLLVNELKSK 
FKEIPVTVYT 
TEVIHGEPII 
QEEVTKVTKF 



NHILKVKVGL 
IIKPAEKSLH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKIIDGV 
I EGGDGHLFE 



NELYNGQILE 
EKLKQDKRFS 
NIILYHLTPG 
IHWDKLLYP 
KIKVIEGSLQ 
PVEITEKETR 
DEEIKRLLQG 



TIGGKQLRVF 
TFLSLLEAAD 
VFIGKGFEPG 
ADTPVGNDQL 
PIIKTEGPTL 
EERIITGPEI 
DTPVRKLQAN 



VYRTAVCIEN 
LKELLTQPGD 
VTNILKTTQG 
LEILNKLIKY 
TKVKIEGEPE 
KYTRISTGGG 
KKVQGSRRRL 



SCMEKGSKQG 
WTLFVPTNDA 
SKI FLKEVND 
IQIKFVRGST 
FRLIKEGETI 
ETEETLKKLL 
REGRSQ 



BCA7 DNA sequence (SEQ ID NO: 3) 

Gene name: 5T4 oncofetal trophoblast glycoprotein; Unigene number: 
Accession #: Z29083; Nucleic Acid Accession #: NM_006670; Coding 
and stop codons underlined) 



CCGGCTCGCG 
AGCTCCGGGG 
GACGGGCGTC 
TCTCCCACCT 
TCCGCCCAGC 
CGCACAGT C A 
GTGCGCAACC 
CGCCGGCCGC 
GTGCGCGCGG 
CCACTGGCCG 
AGTCCCCTTG 
CGGAGCTTCG 
CGCCGCTTGG 
CTGCCCAGCC 
TCCTTCCGCA 
CTTCACAATG 
AACAATCCCT 
GAGGTAGTGC 
GTCCTCTTGG 
CAAACCTCTT 
GTTTTGTATT 
AGGGATCACA 
AACCTCAGTT 
CATGAGATGT 
TAGATACAAC 
TTTCTCGGTG 
TTCTTTTTCT 
TGGGCTTCTT 
ACAGATAGCA 
TATCAGTTTT 
CTGCAGACGT 
AG AGCATG CT 
TTCTTTGACA 
TTTTAATAAA 
ATTCTTAAAA 



CCCTCCGGGC 
AAACGCGAGC 
TGCGGCTGGC 
CCTCGGCATC 
CCCCGCTGCC 
AGTGCGTTAA 
TCTTCCTTAC 
CGCTGGCGGA 
GCGCCTTCGA 
ACCTCAGTCC 
TGGAACTGAT 
AGGGCATGGT 
AGCTGGCCAG 
TCAGGCACCT 
ACCTGACACA 
GCACCCTGGC 
GGGTCTGCGA 
AGGGCAAAGA 
AACTCAACAG 
ATGTCTTCCT 
TGAACCGCAA 
TGGAAGGGTA 
CTAACTCGGA 
AGACTTAAGC 
GGACTTTGAC 
TGTTCTGTTA 
TGGAACTCCT 
GCTGTCTGTC 
TTCAACAAAA 
ATTCTCATGT 
TAGCAGGCTC 
TACATTTTAC 
AAGTAAATTA 
CTGCATCGAG 
GAA 



CCAGCCTCCC 
CGCG ATG CCT 
GCGACTAGCG 
CTCCTTCTCC 
GGACCAGTGC 
CCGCAATCTG 
CGGCAACCAG 
GCTGGCCGCG 
GCATCTGCCC 
CTTCGCTTTC 
CCTGAACCAC 
GGTGGCGGCC 
CAACCACTTC 
GGACTTAAGT 
TCTAGAAAGC 
TGAGTTGCAA 
CTGCCACATG 
CCGGCTCACC 
TGCTGACCTG 
GGGTATTGTT 
GGGGATAAAA 
TCATTACAGA 
TGTC TGAG AA 
TTTATCCCTA 
TAAAAGCAGT 
ATGTAAGACG 
CAACACGTAT 
TCTCTCTCAG 
GCTGCCTCAA 
ACCTAAGTTG 
TTCAAAATAA 
TGTTCTGCAT 
CTTTTTTGAT 
ATCCAACCGA 



GAGCCTTCGG 
GGGGGGTGCT 
CTGGTACTCC 
TCCTCGGCGC 
CCCGCGCTGT 
ACCGAGGTGC 
CTGGCCGTGC 
CTCAACCTCA 
AGCCTGCGCC 
TCGGGCAGCA 
ATCGTGCCCC 
CTGCTGGCGG 
CTTTACCTGC 
AATAATTCGC 
CTCCACCTGG 
GGTCTACCCC 
G CAGACATGG 
TGTGCATATC 
GACTGTGACC 
TTAGCCCTGA 
AAGTGGATGC 
TATGAAATCA 
ATATTAGAGG 
CTAGGCTTGC 
GAAGGGGATT 
ATGAACAGTT 
GGAGGGATTT 
TACAGTTCAA 
CTTTTTCGAG 
TGGAGAAAAT 
CTCCATGGTG 
ATTACAAAAA 
TGCAGTTTAT 
CTGAATTGTT 



AGCGGGCGCC 
CCCGGGGCCC 
TGGGCTGGGT 
CGTTCCTGGC 
GCGAGTGCTC 
CCACGGACCT 
TCCCTGCCGG 
GCGGCAGCCG 
AGCTCGACCT 
ATGCCAGCGT 
CTGAAGATGA 
GCCGTGCACT 
CGCGGGATGT 
TGGTGAGCCT 
AGGACAATGC 
ACATTAGGGT 
TGACCTGGCT 
CGGAAAAAAT 
CGATTCTTCC 
TAGGCGCTAT 
AT AACAT CAG 
ATGCGGACCC 
ACAGACCAAG 
TCCACTTTCA 
TGCTTCCTTG 
GTGTATAGTG 
TTCAGGTTTC 
GGTGTAGCAA 
AAAAATACTT 
AATTGCATCC 
CACAGGAG C A 
ATAACTTGCA 
ATGAAAATGT 
AAAAAAAAAA 



GTCCCAGCCC 
CGCCGCCGGG 
CTCCTCGTCT 
TTCCGCCGTG 
CGAGGCAGCG 
GCCCGCCTAC 
CGCCTTCGCC 
CCTGGACGAG 
CAGCCACAAC 
CTCGGCCCCC 
GCGGCAGAAC 
GCAGGGGCTC 
GCTGGCCCAA 
GACCTACGTG 
CCTCAAGGTC 
TTTCCTGGAC 
CAAGGAAACA 
GAGGAATCGG 
CCCATCCCTG 
TTTCCTCCTG 
AGATGCCTGC 
CAGATTAACA 
GACAACTCTG 
TCCTCCACTA 
TTATGTAAAG 
TTTTACCCTC 
AGCATGAACA 
GTGT AC CCAC 
TATT CATAAA 
TATAAACTGC 
CCTGCATCCA 
ACTTCATAAC 
ACTGATTTTT 
AAAAATAAAG 



480 
540 
600 
660 
720 
780 



Hs. 8212 8; Probeset 
sequence: 85-1347 (start 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



BCA7 Protein sequence (SEQ ID NO:4) 

Gene name: 5T4 oncofetal trophoblast glycoprotein; Unigene number: Hs . 82128; Probeset 
Accession #: Z29083; Protein Accession #: NP_006661; Predicted Signal sequence: 1-32; 
Predicted TM domains: 357-373; PFAM domains: leucine_rich_repeats : 61-90, 119-142, 143-166, 
235-258, 259-282, 294-345; 

Summary: a type la TM protein of unknown function, detected in multiple cancers, with highest 
expression in breast cancer. 



MPGGCSRGPA 
QCPALCECSE 
AALNLSGSRL 
NHIVPPEDER 
LSNNSLVSLT 
HMADMVTWLK 
IVLALIGAIF 



AGDGRLRLAR 
AARTVKCVNR 
DEVRAGAFEH 
QNRS FEGMW 
YVSFRNLTHL 
ETEWQGKDR 
LLVLYLNRKG 



LALVLLGWS 
NLTEVPTDLP 
LPSLRQLDLS 
AALLAGRALQ 
ESLHLEDNAL 
LTCAYPEKMR 
IKKWMHNIRD 



SSSPTSSASS 
AYVRNLFLTG 
HNPLADLSPF 
GLRRLELASN 
KVLHNGTLAE 
NRVLLELNSA 
ACRDHMEGYH 



FSSSAPFLAS 
NQLAVLPAGA 
AFSGSNASVS 
HFLYLPRDVL 
LQGLPHIRVF 
DLDCDPILPP 
YRYEINADPR 



AVSAQPPLPD 
FARRPPLAEL 
APSPLVELIL 
AQLPSLRHLD 
LDNNPWVCDC 
SLQTSYVFIjG 
LTNLSSNSDV 



60 
120 
180 
240 
300 
360 



BCX5 DNA sequence (SEQ ID NO: 5) 
Gene name: LNIR; Unigene number: 



Hs. 61460; Probeset Accession #: AA02 802 8; Nucleic Acid 



Accession #: AF160477; Coding sequence: 225-1757 (start and stop codons underlined) 



GGGGAGCTCG 
CGGGGCCGGG 
GCAGCTTCCT 
TCTCGGTGGA 
GAGCCGAGAT 
CAGGCCGGTG 
AGGACGCAAA 



GAGCTCCCGA 
GCTGGGGCTG 
GCCTTCTGGG 
ACTTCAGAAA 
GTGGGGGCCT 
CCCCGCGGGT 
ACTGCCCTGC 



TCACGGCTTC 
GGTCCCCTAG 
TCAGTTCCTT 
CGCTGGGCAG 
GAGGCCTGGC 
GAG CTGGAGA 
TTCTACCGAG 



TTGGGGGTAG 
TGAGACCCAA 
ATTCAAGTCT 
TCTGCCTTTC 
TGCTGCTGCT 
CCTCAGACGT 
GGGACTCCGG 



CTACGGCTGG 
GTGCGAGAGG 
GCAGCCGGCT 
AACCATGCCC 
GCTACTGCTG 
GGTAACTGTG 
CGAGCAAGTG 



GTGTGTAGAA 
CAAGAACTCT 
CCCAGGGAGA 
CTGTCCCTGG 
GC AT CATTT A 
GTGCTGGGCC 
GGGCAAGTGG 



60 
120 
180 
240 
300 
360 
420 
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CATGGGCTCG GGTGGACGCG GGCGAAGGCG CCCAGGAACT AGCGCTACTG CACTCCAAAT 4 80 

ACGGGCTTCA TGTGAGCCCG GCTTACGAGG GCCGCGTGGA GCAGCCGCCG CCCCCACGCA 54 0 

ACCCCCTGGA CGGCTCAGTG CTCCTGCGCA ACGCAGTGCA GGCGGATGAG GGCGAGTACG 600 

AGTGCCGGGT CAGC?..CCTTC CCCGCCGGCA GCTTCCAGGC GCGGCTGCGG CTCCGAGTGA 660 

TGGTGCCTCC CCTGCCCTCA CTGAATCCTG GTCCAGCACT AGAAGAGGGC CAGGGCCTGA 72 0 

CCCTGGCAGC CTCCTGCACA GCTGAGGGCA GCCCAGCCCC CAGCGTGACC TGGGACACGG 78 0 

AGGTCAAAGG CACAACGTCC AGCCGTTCCT TCAAGCACTC CCGCTCTGCT GCCGTCACCT 84 0 

CAGAGTTCCA CTTGGTGCCT AGCCGCAGCA TGAATGGGCA GCCACTGACT TGTGTGGTGT 90 0 

CCCATCCTGG CCTGCTCCAG GACCAAAGGA TCACCCACAT CCTCCACGTG TCCTTCCTTG 96 0 

CTGAGGCCTC TGTGAGGGGC CTTGAAGACC AAAATCTGTG GCACATTGGC AGAGAAGGAG 102 0 

CTATGCTCAA GTGCCTGAGT GAAGGGCAGC CCCCTCCCTC ATACAACTGG ACACGGCTGG 108 0 

ATGGGCCTCT GCCCAGTGGG GTACGAGTGG ATGGGGACAC TTTGGGCTTT CCCCCACTGA 114 0 

CCACTGAGCA CAGCGGCATC TACGTCTGCC ATGTCAGCAA TGAGTTCTCC TCAAGGGATT 1200 

CTCAGGTCAC TGTGGATGTT CTTGACCCCC AGGAAGACTC TGGGAAGCAG GTGGACCTAG 1260 

TGTCAGCCTC GGTGGTGGTG GTGGGTGTGA TCGCCGCACT CTTGTTCTGC CTTCTGGTGG 132 0 

TGGTGGTGGT GCTCATGTCC CGATAC CATC GGCGCAAGGC CCAGCAGATG ACC CAGAAAT 138 0 

ATGAGGAGGA GCTGACCCTG ACCAGGGAGA ACTCCATCCG GAGGCTGCAT TCCCATCACA 144 0 

CGGACCCCAG GAGCCAGCCG GAGGAGAGTG TAGGGCTGAG AG CCGAGGGC CACCCTGATA 150 0 

GTCTCAAGGA CAACAGTAGC TGCTCTGTGA TGAGTGAAGA GC CCGAGGGC CGCAGTTACT 156 0 

CCACGCTGAC CACGGTGAGG GAGATAGAAA CACAGACTGA ACTG CTGTCT CCAGGCTCTG 162 0 

GGCGGGCCGA GGAGGAGGAA GATCAGGATG AAGGCATCAA ACAGGCCATG AACCATTTTG 16 8 0 

TTCAGGAGAA TGGGACCCTA CGGGCCAAGC CCACGGGCAA TGGCATCTAC ATCAATGGGC 174 0 

GGGGACACCT GGTC TGA CCC AGGCCTGCCT CCCTTCCCTA GGCCTGGCTC CTTCTGTTGA 18 0 0 

CATGGGAGAT TTTAGCTCAT CTTGGGGGCC TCCTTAAACA CCCCCATTTC TTGCGGAAGA 186 0 

TGCTCCCCAT CCCACTGACT GCTTGACCTT TACCTCCAAC CCTTCTGTTC ATCGGGAGGG 192 0 

CTCCACCAAT TGAGTCTCTC CCACCATGCA TGCAGGTCAC TGTGTGTGTG CATGTGTGCC 198 0 

TGTGTGAGTG TTGACTGACT GTGTGTGTGT GGAGGGGTGA CTGTC CGTGG AGGGGTGACT 204 0 

GTGTCCGTGG TGTGTATTAT GCTGTCATAT CAGAGTCAAG TGAACTGTGG TGTATGTGCC 2100 

ACGGGATTTG AGTGGTTGCG TGGGCAACAC TGT CAGGGTT TGGCGTGTGT GTCATGTGGC 216 0 

TGTGTGTGAC CTCTGCCTGA AAAAGCAGGT ATTTTCT CAG ACCCCAGAGC AGTATTAATG 222 0 

ATGCAGAGGT TGGAGGAGAG AGGTGGAGAC TGTGGCT CAG ACCCAGGTGT GCGGGCATAG 22 80 

CTGGAGCTGG AATCTGCCTC CGGTGTGAGG GAACCTGTCT CCTACCACTT CGGAGC CATG 234 0 

GGGGCAAGTG TGAAGCAGCC AGTCCCTGGG TCAGCCAGAG GCTTGAACTG TTACAGAAGC 24 00 

CCTCTGCCCT CTGGTGGCCT CTGGGCCTGC TGCATGTACA TATTTTCTGT AAATATACAT 24 60 

GCGCCGGGAG CTTCTTGCAG GAATACTGCT CCGAATCACT TTTAATTTTT TTCTTTTTTT 252 0 

TTTCTTGCCC TTTC CATTAG TTGTATTTTT TATTTATTTT TATTTTTATT TTTTTTTAGA 2580 

GATGGAGTCT CACTATGTTG CTCAGGCTGG CCTTGAACTC CTGGGCTCAA GCAATCCTCC 264 0 

TGCCTCAGCC TCCCTAGTAG CTGGGACTTT AAGTGTACAC CACTGTGCCT GCTTTGAATC 2 70 0 

CTTTACGAAG AGAAAAAAAA AATTAAAGAA AGCCTTTAGA TTTATCCAAT GTTTACTACT 2760 

GGGATTGCTT AAAGTGAGGC CCCTCCAACA CCAGGGGGTT AATTCCTGTG ATTGTGAAAG 2 82 0 

GGGCTACTTC CAAGGCATCT TCATGCAGGC AGCCCCTTGG GAGGGCACCT GAGAGCTGGT 288 0 

AGAGTCTGAA ATTAGGGATG TGAGCCTCGT GGTTACTGAG TAAGGTAAAA TTGCATCCAC 294 0 

CATTGTTTGT GATACCTTAG GGAATTGCTT GGACCTGGTG ACAAGGGCTC CTGTTCAATA 3000 

GTGGTGTTGG GGAGAGAGAG AGCAGTGATT ATAGACCGAG AGAGTAGGAG TTGAGGTGAG 3060 

GTG AAGGAGG TGCTGGGGGT GAG AATGT CG CCTTTCCCCC TGGGTTTTGG AT C ACT AATT 3120 

CAAGGCTCTT CTGGATGTTT CTCTGGGTTG GGGCTGGAGT TCAATGAGGT TTATTTTTAG 3180 

CTGGCCCACC CAGATACACT CAGCCAGAAT ACCTAGATTT AGTACCCAAA CTCTTCTTAG 324 0 

TCTGAAATCT GCTGGATTTC TGGCCTAAGG GAGAGGCTCC CATCCTTCGT TCCCCAGCCA 330 0 

GCCTAGGACT TCGAATGTGG AGC CTGAAGA TCTAAGATCC TAACATGTAC ATTTTATGTA 33 6 0 

AATATGTGCA TATTTGTACA TAAAATGATA TTCTGTTTTT AAATAAACAG ACAAAACTTG 342 0 
TTCAAAAAAA AAAAAAAAAA AAAAAAAAA 

BCX5 Protein sequence (SEQ ID NO: 6) 

Gene name: LNIR; Unigene number: Hs.614 60; Probeset Accession #: AA028028; Protein 
Accession #: AF160477; Predicted Signal sequence: 1-26; Predicted TM domains : 355-371; PFAM 

domains: IgSF_domain: 45-129, 162-225, 263-317; Summary: A type la TM protein; is a member 
of the immunoglobulin superfamily. 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGE LETSD W TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 12 0 

DEGEYECRVS TFPAGSFQAR LRLRVMVP PL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 

VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 24 0 

HVS FLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDG P LP SGVRVDGDTL 30 0 

GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 36 0 

FCLLVWWL MS RYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 42 0 

EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

mouse BCX5 Protein sequence (SEQ ID NO: 7) 

Gene name: mouse_LNIR; Unigene number: n/a; Probeset Accession #: BF168327; Protein 
Accession #: n/a; Predicted Signal sequence: 1-27; Predicted TM domains: 346-362; PFAM 
domains: IgSF_domains : 44 -126 , 166 -221 , 259-313 ; Summary: This is the mouse orthologue of human 
BCX5; it is a type la TM protein of unknown function. 

MPLSLGAEMW GPEAWLRLLF LASFTGQYSA GELETSDWT WLGQDAKLP CFYRGDPDEQ 60 

VGQVAWARVD PNEXYPGAGL LHSKYGLHVN PAYEDRVEQX XHETFRRSVL LRNAVQADEG 120 
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EYECRVSTFP SGSFQARMRL RVLVPPLPSL NPGPPLEEGQ ADVAASCTAE GSPAPSVTWD 180 

TEVKGTQSSR S FTHPRS AAV TSEFHLVPSR SMNGQPLTCV VSHPGLLQDR RITHTLQVAF 24 0 

LAEASVRGLE DQNLWQVGRE GATLKCLSEG QPPPKYNWTR LDGPLPSGVR VKGDTLGFPP 300 

LTTEHSGVYX CHVSNELSSR DSQVTVEVLD PEDPGKQVDL VSASVIIVGV IAALLFCLLV 3 60 

VVVVLMSRYU RRKAQQMTQK YEEELTLTRE NS I RRLHSHH SDPRSQPEES VGLRAEGHPD 4 20 

SLKDNSSCSV MSEEPEGRSY STLTTVREIE TQTELLSPGS GRTEEDDDQD EGIKQAMNHL 4 80 
CRKMGP 



BCZ6 DNA sequence (SEQ ID NO: 8) 

Gene name: IL-6 receptor beta chain (gpl30; oncostatin M receptor); Unigene number: 

Hs. 82065; Probeset Accession #: M57230 / AA406546; Nucleic Acid Accession #: NM_002184; 

Coding sequence: 256-3012 (start and stop codons underlined) 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 

CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 

AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 

TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATCCGC GCAAG ATG TT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 3 00 

CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTT AT AT C AG TCCTGAATCT 3 60 

CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 4 20 

GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 4 80 

AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 

TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 6 00 
TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 6 60 
GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 7 20 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 7 80 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 
CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 

CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 

AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 

GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1140 

GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 

GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 12 60 

ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 

CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 

CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 14 40 

TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 

ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG AT CTTAAAGC ATT CCCCAAA 1560 

GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 

GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 

ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 

GTTACTCCAG TATATGCTGA TGGAC CAGG A AGCCCTGAAT C C AT AAAGGC ATACCTTAAA 1800 

CAAGCTCCAC CTTC CAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 

GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTT C AGAATG GATTTATCAG AAATTATACT 1920 

ATATTTTATA GAACCAT CAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 

GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2 040 

ACAGATGAAG GTGGGAAGGA TGGT CCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 

GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 

CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 

TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 

ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 24 00 

AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 24 60 

TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2 520 

ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 25 80 

GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 

CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 

AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 

CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2 820 

ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2 880 

GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2 940 

GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3 000 

ATGCCTCAG T GAA GGACTAG TAGTTCCTGC TACAACTTCA GCAGTAC CT A TAAAGTAAAG 3 060 
CTAAAATGAT TTTATCTGTG AATTC 



BCZ6 Protein sequence (SEQ ID NO: 9) 

Gene name: IL-6 receptor beta chain (gpl30; oncostatin M receptor); Unigene number: 

Hs. 82065; Probeset Accession #: M57230 / AA406546; Protein Accession #: NP_002175; Predicted 

Signal sequence: 1-22; Predicted TM domains: 625-641; PFAM domains: 

fibronectin_type_III_domains: 222-311, 424-509, 519-606; Summary: A type I TM protein; it 
homodimerizes or heterodimerized to make a functional receptor for IL-6, oncostatin-M, IL-11, 
LIF, and CNTF. 

MLTLQTWWQ ALFIFLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
N ANY I VWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 120 
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ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 180 

SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 24 0 

KLTWTNPSIK SVIILKYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYVFRIR 300 

CMKEDGKGYW SDWSESASGI TYEDRPSKAP SFWYKIDPSH TOGYRTVQLV WKTLPPFEAN 360 

GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 

FQATH PVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 

YLRGNLAESK CYLITVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 54 0 

QLPVDVQNGF IRNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTD EGG 600 

KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC FNKRDLIKKH IWPNVPDPSK 660 

SHIAQWSPHT P PRHNFNSKD QMYSDGNFTD VSWEIEAND KKPFPEDLKS LDLFKKEKIN 72 0 

TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 78 0 

ESTQPLLDSE ERPSDLQLVD HVDGGDG I L P RQQYFKQNCS QHESSPDISH FERSKQVSSV 84 0 

NEED F VRLKQ QISDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

BFG4 DNA sequence (SEQ ID NO: 10) 

Gene name: KIAA0882 protein; Unigene number: Hs. 90419; Probeset Accession #: Z39762 
Nucleic Acid Accession #: AB020689; Coding sequence: 108-2777 (start and stop codons 
underlined) 

GAACTTATGT AGCCTCATTA TCCCGCTCCG TGAGGTGACA ATTGTGGAAA AGGCAGACAG 60 

CTCCAGTGTG CTCCCCAGTC CCTTATCACA TCAGCACCCG AAACAGG ATG ACCTTCCTAT 12 0 

TTGCCAACTT GAAAGATAGA GACTTT CTAG TGCAGAGGAT CTCAGATTTC CTGCAACAGA 180 

CTACTTCCAA AATATATTCT GACAAGGAGT TTG CAGGAAG TTACAACAGT TCAGATGATG 24 0 

AGGTGT ACT C TCGACCCAGC AGCCTCGTCT CCTCCAGCCC CCAGAGAAGC ACGAGCTCTG 30 0 

ATGCTGATGG AGAGCGCCAG TTTAACCTAA ATGGCAACAG CGTCCCCACA GCCACACAGA 36 0 

CCCTGATGAC CATGTATCGG CGGCGGTCTC CCGAGGAGTT CAACCCGAAA TTGGCCAAAG 42 0 

AGTTTCTGAA AGAGCAAGCC TGGAAGATTC ACTTTGCTGA GTATGGGCAA GGGATCTGCA 480 

TGT AC CGCAC AGAGAAAACG CGGGAGCTGG TGTTGAAGGG CATCCCGGAG AGCATGCGTG 54 0 

GGGAGCTCTG GCTGCTGCTG TCAGGTGCCA TCAATGAGAA GGCCACACAT CCTGGGTACT 600 

ATGAAGACCT AGTGGAGAAG TCCATGGGGA AGTATAATCT CGCCACGGAG GAGATTGAGA 66 0 

GGGATTTACA CCGCTCCCTT CCAGAACACC CAGCTTTTCA GAATGAAATG GGCATTGCTG 72 0 

CACTAAGGAG AGTCTTAACA GCTTATGCTT TTCGAAATCC CAACATAGGG TATTGCCAGG 78 0 

CCATGAATAT TGTCACTTCA GTGCTG CTGC TTTATGCCAA AGAGGAGGAA GCTTTCTGGC 84 0 

TGCTTGTGGC TTTGTGTGAG CGCATGCTCC CAGATTACTA CAACACCAGA GTTGTGGGTG 90 0 

CACTGGTGGA CCAAGGTGTC TTTGAGGAGC TAGCACGAGA CTACGTCCCA CAGCTGTACG 96 0 

ACTGCATGCA AGACCTGGGC GTGATTTCCA CCATCTCCCT GTCTTGGTTC CTCACACTAT 102 0 

TTCTCAGTGT GATGCCTTTT GAGAGTGCAG TTGTGGTTGT TGACTGTTTC TTCTATGAAG 108 0 

GAATTAAAGT GATATT CCAG TTGGCCCTAG CTGTGCTGGA TGCAAATGTG GACAAACTGT 114 0 

TGAACTGCAA GGATGATGGG GAGGCCATGA CCGTTTTGGG AAGGTATTTA GACAGTGTGA 120 0 

CCAATAAAGA CAGCACACTG CCTCCCATTC CTCACCTCCA CTCCTTGCTC AGCGATGATG 126 0 

TGGAACCTTA CCCTGAGGTA GACATCTTTA GACTCATCAG AACTTCCTAC GAGAAATTCG 132 0 

GAACTAT CCG GGCAGATTTG ATTGAACAGA TGAGATTCAA ACAGAGACTG AAAGTGATCC 13 8 0 

AGACGCTGGA GGATACTACG AAACGCAACG TGGTACGAAC CATTGTGACA GAAACTTCCT 144 0 

TTACCATTGA TGAGCTGGAA GAACTTTATG CTCTTTTCAA GGCAGAACAT CTCACCAGCT 150 0 

GCTACTGGGG CGGGAGCAGC AACGCGCTGG ACCGGCATGA CCCCAGCCTG CCCTACCTGG 156 0 

AACAGTATCG CATTGACTTC GAGCAGTTCA AGGGAATGTT TGCTCTTCTC TTTC CTTGGG 162 0 

CATGTGGAAC TCACTCTGAC GTTCTGGCCT CCCGCTTGTT CCAGTTATTA GATGAAAATG 16 8 0 

GAGACTCTTT GATTAACTTC CGGGAGTTTG TCTCTGGGCT AAGTGCTGCA TGCCATGGGG 174 0 

ACCTCACAGA GAAGCTCAAA CTCCTGTACA AAATGCACGT CTTGCCTGAG CCATCCTCTG 180 0 

ATCAAGATGA ACCAGATTCT GCTTTTGAAG CAACTCAGTA CTTCTTTGAA GATATTACCC 186 0 

CAGAATGTAC ACATGTTGTT GGATTGGATA GCAGAAGCAA ACAGGGTGCA GATGATGGCT 192 0 

TTGTTACGGT GAGCCTAAAG CCAGACAAAG GGAAGAGAGC AAATTCCCAA GAAAATCGTA 198 0 

ATTATTTGAG ACTGTGGACT C CAGAAAAT A AATCTAAGTC AAAGAATGCA AAGGATTTAC 2 04 0 

CCAAATTAAA TCAGGGG C AG TTCATTGAAC TGTGTAAGAC AATGTATAAC ATGTTCAGCG 2100 

AAGACCCCAA TGAGCAGGAG CTGTACCATG CCACGGCAGC AGTGACCAGC CTCCTGCTGG 216 0 

AGATTGGGGA GGTCGGCAAG TTGTTCGTGG CCCAGCCTGC AAAGGAGGGC GGGAGCGGAG 222 0 

GCAGTGGGCC GTCCTGCCAC CAGGGCATCC CAGGCGTGCT CTTCCCCAAG AAAGGG C C AG 22 8 0 

GCCAGCCTTA CGTGGTGGAG TCTGTTGAGC CCCTGCCGGC CAGCCTGGCC CCCGACAGCG 234 0 

AGGAACACTC CCTTGGAGGA CAAATGGAGG ACATCAAGCT GGAGGACTCC TCGCCCCGGG 24 0 0 

ACAACGGGGC CTGCTCCTCC ATG CTGATCT CTGACGACGA CACCAAGGAC G AC AG CT CC A 246 0 

TGTCCTCATA CTCGGTGCTG AGTGCCGGCT CCCACGAGGA GGACAAGCTG CACTG CGAGG 2 52 0 

AAATCGGAGA GGACACGGTC CTGGTGCGGA GCGGCCAGGG CACGGCGGCA CTGCCCCGGA 2 580 

GCACCAGCCT GGACCGGGAC TGGGCCATCA CCTTCGAGCA GTTCCTGGCC TCCCTCTTAA 264 0 

CTGAGCCTGC CCTGGTCAAG TACTTTGACA AGCCCGTGTG CATGATGGCC AGGATTACCA 2 70 0 

GTGCAAAAAA CATCCGGATG ATGGGCAAGC CCCTCACCTC GGCCAGTGAdf TATGAAATCT 2 760 

CGGCCATGTC CGGC TGA CAC GGGCGCCTTC CCGGGGGAGT GGGAGGAGAG GGAGGGGAGG 2 82 0 

GATTTTTTAT GTTCTTCTGT GTTGAGTTTT TTCTTTCTTT CTTTTAAATT AAATATTTAT 2 880 

TAGTACCTGG AATTGAAGCC TAGTGTTTTC ATAATGTAAT TCAATGAAAA CTGTTGGAGA 2 94 0 

AATATTTAAA CACCTCAATG TAGGTACATT ACACTCTTGT TGCGGGGAGG GGATTTACCA 3000 

GAATACAGTT TATTTCGTGA ATTCTAAAAA ACAAAAAGAT GAATCTGTCA GTGATATGTG 3 060 

TGTATTATAA CTTATTAATC TTGCTGTTGA GCTGTATACA TGGTTTAAAA AATAGTACTG 3120 

TTTAATGCTA AGTAAGGCAG CAGTCATTTG TGTATTCAGG CTTTTTAAAT AAAATTAGAG 3180 

CTGTAAGGAA AATGAAAAGC CACAAATGCA AGACTGTTCT TAAATGGAAG GCATAGTCAG 3 24 0 

CGAGGGTAAA TCCTATACCA CTTTAGGAAG TATTAAAAAT ATTTTTAAGA TTTGAAATAT 3 3 00 

ATTTCATAGA AGTCCTCTAT TCAAAATCAT ATTCCACAGA TGTTCCCCTT CAAAGGGAAA 3 360 
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ACATTTGGGG TTCTAAACAG TTATGAAAGT AAGTGATTTT TACATGATTC CAGAATAACA 342 0 

CTTGTATTGA CCAATTTAGA CAGATACCAG ACCAATTTTG CATTTAAGAA ATTGTTCTGA 34 80 

TTATTTACGT CAACTCATTA GAATTCAGTG AAAAGTAACA GTCTTTTGTC ACAGAGAATC 354 0 

TGAAAGTAGC AGCAAAGACA GAGGGCTCAT GACAGGTTTT TGCTTTTGCT TTGCTTTTGT 3600 

TTTTGAAAGA GTAAAAGTAC TGATGCTTCT GATACTGGAT GTTTAGCTTC TTACTGCAAA 3660 

AACATAAGTA AAACAGTCAA CTTTACCATT TCCGTATTCT CCATAGATTG AAGAAATTTA 3720 

TACCACATAT CGCATATGAC CATCTTTCCA TCAAATCAAT GTAGAGATAA TGTAAACTGA 3780 

AAAAAAATCT GCAAGATAAT GTAACTGAAT GTTTTAAAAA CAGAACTTGT CACTTTATAT 3 840 

AAAAGAATAG TATGCTCTAT TTCCTGAATG GATGTGGAAA TGAAAGCTAG CGCACCTGCA 3900 

CTTTGAATTC TTGCTTCTTT TTTATTACTG TTATGATTTT GCTTTTTACA GATGTTGGAC 3960 

GATTTTTTCT TCTGATTGTT GAATTCATAA TCATGGTCTC ATTT CCTTTG CTTCTTTGGA 4020 

ATATTTCTTT CAACAC ATT C CTTTATTTTA TTATACATTG TGTCCTTTTT TTAGCTATTG 4080 

CTGCTGTTGT TTTTTATTCT ATTTACAGGA TGATTTTTAA ACTGTCAAAT GAAGTAGTGT 414 0 

TAACCTCAAA TAGGCTAAAT GTGAACAAAT AAAATACAGC AAATACTCAG AAAAAAAAAA 42 00 
AAAAAAAAAA AAAAA 

BFG4 Protein sequence (SEQ ID NO: 11) 

Gene name: KIAA0882 protein; Unigene number: Hs. 90419; Probeset Accession #: Z39762; 

Predicted TM domains: 302-318; PFAM 
ane protein, likely localized to the 



SSLVSSSPQR 60 

AWKIHFAEYG 120 

KSMGKYNLAT 180 

SVLLLYAKEE 24 0 

GVISTISLSW 300 

GEAMTVLGRY 360 

LIEQMRFKQR 4 20 

SNALDRHD PS 4 80 

FREFVSGLSA 540 

VGLDSRSKQG 600 

QFIELCKTMY 660 

HQGI PGVLFP 720 

SMLISDDDTK 780 

DWAITFEQFL 840 



Protein Accession #: BAA74905; Signal sequence: none; 
domains: TBC_domain: 135-347; Summary: a Type II memb 
peroxisome . 

MTFLFANLKD RDFLVQRISD FLiQQTTSKIY SDKEFAGSYN SSDDEVYSRP 
STSSDADGER QFNLNGNSVP TATQTLMTMY RRRSPEEFNP KLAKEFLKEQ 
QGICMYRTEK TRELVLKGIP ESMRGELWLL LSGAINEKAT HPGYYEDLVE 
EEIERDLHRS LPEHPAFQNE MGIAALRRVL TAYAFRNPNI GYCQAMNIVT 
EAFWLLVALC ERMLPDYYNT RWGALVD QG VFEELARDYV PQLYDCMQDL 
FLTLFLSVMP FESAWWDC FFYEGIKVIF QLALAVLDAN VDKLLNCKDD 
LDSVTNKDST LPPIPHLHSL LSDDVEPYPE VDIFRLIRTS YEKFGTIRAD 
LKVIQTLEDT TKRNWRTIV TETSFTIDEL EELYALFKAE HLTSCYWGGS 
LPYLEQYRID FEQFKGMFAL LFPWACGTHS DVLASRLFQL LDENGDSLIN 
ACHGDLTEKL KLLYKMHVLP EPSSDQDEPD SAFEATQYFF EDITPECTHV 
ADDGFVTVSL KPDKGKRANS QENRNYLRLW TPENKSKSKN AKDLPKLNQG 
NMFSEDPNEQ ELYHATAAVT SLLLEIGEVG KLFVAQ PAKE GGSGGSGPSC 
KKGPGQPYW ESVEPLPASL APDSEEHSLG GQMEDIKLED SSPRDNGACS 
DDSSMSSYSV LSAGSHEEDK LHCEEIGEDT VLVRSGQGTA ALPRSTSLDR 
ASLLTEPALV KYFDKPVCMM ARITSAKNIR MMGKPLTSAS DYEISAMSG 



BCU7 DNA sequence (SEQ ID NO: 12) 

Gene name: EST; Unigene number: Hs. 9855 8; Probeset Accession #: AA42 8062; Nucleic Acid 
Accession #: n/a; Coding sequence: 1-573 (stop codon underlined) 



TATTTTATTT 
TTTTTCACAG 
CAGAAATGGT 
TTTTCATGTG 
CGATCCTCTC 
CAATATAGGT 
AAACTGATGG 
CTGGTTATGA 
TTATTGCTTC 
TTGCCCTACA 
TTCATTTGAA 
ATTTACACAG 
GAAACCAACT 
TGTTGATGCC 
AAGGAAAAAT 
TCTG CAGTAT 
TTAGGAAGGA 
ATAAAAATTT 
CATG CAAAAT 
AAAAACTTAG 
CACATATCCT 
GAACTGT CAG 
TGGCTTTAGT 
ATTCAGAAAT 
ACTCTGTTGT 
TTCTTTGGGA 
ATATCACCTA 
TCTGGTGGTT 
GTACCATAAA 
AC CT AAAATG 
GGCACCCACT 
TGCTTTTTTT 
TTTATAAACA 
AGTGATCATT 



TCCAGGCTAA 
CATGCAACAA 
CTGGGTATAA 
GAAAGGTGTC 
TGTGGTGTTA 
TTTGGGGCTT 
ATTTCATTAT 
CAGATAAGTT 
TGTTCAGTTG 
AATTTCATTT 
AGTTCCATGC 
AATATTTGAA 
T ATCTG CAT A 
ATGCTTATCA 
AATGCTTAGA 
CGAAGGATGC 
TAAAGTTACA 
GTCATTATTT 
AATTTTTTAA 
AAGGTTTAAC 
TGCTATAAGT 
TATCTATTCT 
AATGCTCCTT 
AT C AATAAAA 
TTTAATT CAA 
GTTATTTTCT 
AACTGGTTAG 
ATGAAGACAA 
AATTTGTTGT 
TTTGATAATA 
TTCCTAGTAA 
TTTCTGAATT 
CAATTAGAAT 
TATTTTTATT 



AG CAAATGAA 
TGGTGCTAGG 
GCTATTTCAT 
ATTCAAAAAA 
ATTTTTTTAT 
CCATATCATC 
AGAATTATCT 
TGCTCAAAAA 
TCTAAGTATC 
GGCAGCGCCA 
AGCTTTAGCA 
TTGAAACAAT 
ATTAAATCTA 
AATACATGCA 
CTTTGGTGTA 
AAATGTTGAC 
TTTGTCTTAA 
ATGCTGGTGA 
ATTATGTTAT 
ACTTCACTGA 
TTTGAAGTTT 
GGTGCTAAAT 
TTATTCATTG 
TCCTATGTTA 
CAGTCCAACA 
TTTTAAAATC 
ATTACTTCTA 
ATTCTTAATG 
GCAATATTAG 
TTAACATGTA 
TGTTTTCCAT 
AATTAGGTAT 
TACAATTAAT 
TAGCACAGGT 



AGTTTGCTGG 
ATAGCTATTT 
AAAAGCAGCT 
AAAGTAATTG 
ATGACCAGTA 
AAAAGACTGA 
GTGAGTTGTG 
ATGTGGATGA 
ATCCCTTCTG 
TAAC AT T CAT 
CAGAGTTGAC 
AGAAATTTTT 
ATACATATTT 
CAAGCTAAAC 
GGTTCTTCCT 
ATAGATGGAA 
TTTCTAACAT 
AACGTATAAT 
TGTTTAAATT 
TTAATGGTGC 
CTTAGCAATT 
GTATGGTGCT 
CTAAATTTAG 
AATTAAT CTT 
TTATTTAGGT 
TTTTTATAGC 
CAGCTAATAA 
G CTACTTG AC 
AATTATCATA 
TCTAAGAGGA 
GATTTTCCAG 
TGGTAAAATA 
TAACAGAGGT 
CATAAGAAAA 



TATCAACACA 
CTTACTGTAA 
TTAAATTGTC 
G CAT ACATAT 
GAAAAATTTT 
AAAATTATAA 
TAGACACAGT 
AGCCATTATT 
TGGC CCATCA 
TTAAAAAGTT 
CAAACACTGG 
CTCATAATAT 
AAGCCAGTTT 
ATAATTTGAA 
GTGTAGCCAT 
GCTCTTACCT 
TATCTTTGCT 
CACATCCAAT 
TGACTTATGG 
TGAAAACACG 
AAAGTTTTTT 
AAATGAATTG 
TGTTATC C AT 
TACCAAAAAC 
GTTACAGAGT 
TTGGCAATGT 
TATTGCAGGC 
CTACAGCAAA 
TGTTTCCTAC 
AAAAAGAGTT 
TTCTGAGGCA 
TATTTTTAAA 
ATAATTGTCT 
ATATATAGAA 



GCCTGCCATA 
TTGCCAGAGG 
AGTATTAAGG 
TCCACATCAT 
AATATTCTCA 
TTTTAGAATT 
CTTAATGTTT 
GTTATTATTG 
CGCAGCAGAG 
TATGAAAACA 
CGTAAGTTCA 
ATACCTATGT 
AAGTGCTTTG 
TGGGTCTATG 
ATACCCAGGC 
ACCAAAGTGT 
TTTATGTTTC 
TATTTGAACA 
GAGATCAGTC 
TTACAATTAC 
TATT CAGTGT 
TTAGTGTTGA 
TTGATTCCTG 
AGG CAAGTT A 
GTAAATATAT 
CCAAAGTCAA 
ACTGGCGCCC 
AGCCATTTCT 
ATCTGACAGC 
AATATATT CT 
CTTATTAAAG 
TTTAGTTAGC 
CACTTTCAGA 
AAATAATCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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TTTCATATAT AAAAGGATTA TTTCTCCACC TTTAATTATT GGCCTATCAT TTGTTAGTGT 2100 

TATTTGGTCA TATTATTGAA CTAATGTATT ATTCCATTCA AAGTCTTTCT AGATTTAAAA 2160 

ATGTATGCAA AAGCTTAGGA TTATATCATG TGTAACTATT ATAGATAACA TCCTAAACCT 2220 

TCAGTTTAGA TATATAATTG ACTGGGTGTA ATCTCTTTTG TAATCTGTTT TGACAGATTT 2280 

CTTAAATTAT GTTAGCATAA TCAAGGAAGA TTTACCTTGA AGCACTTTCC AAATTGATAC 234 0 

TTTCAAACTT ATTTTAAAGC AGTAGAACCT TTTCTATGAA CTAAATCACA TGCAAAACTC 2400 

CAACCTGTAG TATACATAAA ATGGACTTAC TTATTCCTCT CACCTTCTCC AGTGCCTAGG 2460 

AATATTCTTC TCTGAGCCCT AGGATTGATT CTATCACACA GAGCAACATT AATCTAAATG 252 0 

GTTTAGCTCC CTCTTTTTTC TCTAAAAACA ATCAGCTAAT AAAAAAAAAA TTTGAGGGCC 2580 

TAAATTATTT CAATGGTTGT TTGAAATATT CAGTTCAGTT TGTACCTGTT AGCAGTCTTT 264 0 

CAGTTTGGGG GAGAATTAAA TACTGTGCTA AGCTGGTGCT TGGATACATA TTACAGCATC 2700 

TTGTGTTTTA TTTGACAAAC AGAATTTTGG TGCCATAATA TTTTGAGAAT TAGAGAAGAT 2760 

TGTGATGCAT ATATATAAAC ACTATTTTTA AAAAATATCT AAATATGTCT CACATATTTA 282 0 

TATAATCCTC AAATATACTG TACCATTTTA GATATTTTTT AAACAGATTA ATTTGGAGAA 2880 

GTTTTATTCA TTACCTAATT CTGTGGCAAA AATGGTGCCT CTGATGTTGT GATATAGTAT 2 94 0 

TGTCAGTGTG TACATATATA AAACCTGTGT AAACCTCTGT CCTTATGAAC CATAACAAAT 3000 

GTAGCTTTTT AAAGTCCATT GTATTGTTTT TTCTTTCAAT AAAAGAGTAT AATTAATTGG 3060 
TTGTTTTTGA 



BCU7 Protein sequence (SEQ ID NO: 13) 

Gene name: EST; Unigene number: Hs . 98558; Probeset Accession #: AA428062; Protein Accession 
#: n/a; Signal sequence: none; Predicted TM domains: 125-141, 154-170; PFAM domains: none; 
Summary: A type III membrane protein, highly overexpressed in breast cancer and prostate 
cancer; unknown function. 

YF I FQAKANE SLLVSTQPAI F FT ACNNGAR IAISYCNCQR QKWSGYKLFH KSSFKLSVLR 6 0 

FSCGKVSFKK KVIGIHIPHH RSSLWCXFFY MTSRKILIFS QYRFWGFHII KRLKNYNFRI 12 0 

KLMDFIIELS VSCVDTVLMF LVMTDKFAQK MWMKPLLLLL LLLLFSCLSI IPSVAHHAAE 180 
LPYKFHLAAP 

BFA1 DNA sequence (SEQ ID NO: 14) 

Gene name: calsyntenin-2 ; Unigene number: Hs.7413; Probeset Accession #: R46025; Nucleic 

Acid Accession #: NM_022131; Coding sequence: 11-2878 (start and stop codons underlined) 

TGCTGCGAGG ATG CTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 

GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 

CAATAAGCAC AAGC CATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 18 0 

CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 24 0 

AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 

CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 

GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 

CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 

TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 54 0 

CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 

CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 

CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 

CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 

CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 84 0 

GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 

CGTCACAGAG CTG CAGACT A ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 

GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 

TAGCGCTGCC ACCAACTGGA CTG CAGGACT GCTGGTGGAC AG CAGTGAGA TGATCTTCAA 108 0 

GTTTGACGGC AGG CAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 114 0 

TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 

GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCAT C ACT ATGCCCTGTA 1260 

TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 132 0 

TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 

TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 144 0 

ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 

CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAAC CAC AG TTTGCTCAGT TCTTTCATGG 1560 

AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 

CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 

AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 174 0 

CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 

TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1860 

TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 

CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 

CCTCTTCCCT GATATCAAGA TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2 040 

GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 210 0 

TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AG CT CAACCA 216 0 

CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 222 0 

CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT AC CGCAACTG 2280 

GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 234 0 

CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAG AC C AAGTCTCAGA 24 00 

TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 

TGAGTCCCGG AGTAG CAT C C AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 

CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 264 0 

CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 

GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAG CCGAG GAAGAAATGA GCTCCAGCAG 2760 

TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGG CAGAG GCAGACATGG 2820 

GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTAC TAG TG 2880 

CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 294 0 

CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 

TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

AAGAAGCCCA GCATGG CC AT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 

CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 318 0 

TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 324 0 

GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 

CTGACTCCAG GTTGCTT CAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 

TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 342 0 

CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGC CTTT 3480 

CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 354 0 

TCCTTGGCCA CAAGCAGGGT CTGATCCCCC AT CAGAG CTA TCTGAGCCTG CCTGTCTGGG 3600 

CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 372 0 

GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 

AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3 84 0 

GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3 900 

TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3 960 

AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4 020 

AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4 080 

TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 414 0 

TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 42 00 

CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 426 0 

ACGAAGCTCT GAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 432 0 

TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGG C AC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA A 



BFA1 Protein sequence (SEQ ID NO: 15) 

Gene name: calsyntenin-2 ; Unigene number: Hs.7413; Probeset Accession #: R46025; Protein 
Accession #: NP_071414; Predicted Signal sequence: 1-20; Predicted TM domains: 832-84 8; 
PFAM domains: cadherin_domains : 48-151, 165-254; Summary: A type I membrane protein; a 
member of the calsyntenin family; is related to the FAT tumor suppressor; is likely an 
adhesion molecule important in mammalian developmental processes and cell communication. 



MLPGRLCWVP LLliALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG VITENNDTVI 60 

LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS PIDCELQKEY 120 

TFIIQAYDCG AG PHETAWKK SHKAWHIQV KDVNEFAPTF KEPAYKAWT EGKIYDSILQ 180 

VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 24 0 

KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 3 00 

LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 

RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 

CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DKEWHYYVIN VEFPWTLYM DGATYEPYLV 4 80 

TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 54 0 

CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 6 00 

LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARGVTLFP 660 

DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 

HQRHLDATNS TAG YS I YGVG SMSRYEQVLH H I RYRNWR PA S LEARRFR I K CSELNGRYTS 780 

NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIIIS 84 0 

VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 



BFG7 DNA sequence (SEQ ID NO: 16) 

Gene name: EST; Unigene number: Hs . 91668; Probeset Accession #: Z40805; Nucleic Acid 
Accession #: n/a; Coding sequence: <l-906 (stop codon underlined) 



CGGGTCGACC CACGCGTCCG GGGAGAAAGG ATGGCCGGCC TGGCGGCGCG GTTGGTCCTG 60 

CTAGCTGGGG CAGCGGCGCT GGCGAGCGGC TCCCAGGGCG ACCGTGAGCC GGTGT ACCG C 12 0 

GACTGCGTAC TGCAGTGCGA AGAGCAGAAC TGCTCTGGGG GCGCTCTGAA TCACTTCCGC 180 

TCCCGCCAGC CAATCTACAT GAGT CTAGC A GGCTGGACCT GTCGGGACGA CTGTAAGTAT 24 0 

GAGTGTATGT GGGTCACCGT TGGGCTCTAC CTCCAGGAAG GTCACAAAGT GCCTCAGTTC 3 00 

CATGG CAAGT GGCCCTTCTC CCGGTTCCTG TTCTTTCAAG AGCCGGCATC GGCCGTGGCC 360 

T CGTTTCTC A ATGGCCTGGC CAGCCTGGTG ATGCTCTGCC GCTACCGCAC CTTCGTGC C A 42 0 

GCCTCCTCCC CCATGTACCA CACCTGTGTG GCCTTCGCCT GGGTGTCCCT CAATGCATGG 4 80 

TTCTGGTCCA CAGTYTTCCA CACCAGGGAC ACTGACCTCA CAGAGAAAAT GGACTACTTC 54 0 

TGTGCCTCCA CTGTCATCCT ACACTCAATC TACCTGTGCT GCGTCAGCCT CATCCGCTTC 600 

GACTATGGCT AC AAC CTGGT GGCCAACGTG GCTATTGGCC TGGTCAACGT GGTGTGGTGG 660 

CTGGCCTGGT GCCTGTGGAA CCAGCGGCGG CTGCCTCACG TGCGCAAGTG CGTGGTGGTG 72 0 

GTCTTGCTGC TGCAGGGGCT GTCCCTGCTC GAGCTGCTTG ACTTCCCACC GCTCTTCTGG 78 0 

GTCCTGGATG CCCATGCCAT CTGGCACATC AGCACCATCC CTGTCCACGT CCTCTTTTTC 84 0 

AGCTTTCTGG AAGATGACAG CCTGTACCTG CTGAAGGAAT CAGAGGACAA GTTCAAGCTG 900 
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GAC TGAA GAC CTTGGAGCGA GTCTGCCCCA GTGGGGATCC TGCCCCCGCC CTGCTGGCCT 960 

CCCTTCTCCC CTCAACCCTT GAGATGATTT TCTCTTTTCA ACTTCTTGAA CTTGGACATG 1020 

AAGGATGTGG GCCCAGAATC ATGTGGC C AG CCCACCCCCT GTTGGCCCTC ACCAGCCTTG 1080 

GAGTCTGTTC TAGGGAAGGC CTCCCAGCAT CTGGGACTCG AGAGTGGGCA GCCCCTCTAC 1140 

CTCCTGGAGC TGAACTGGGG TGGAACTGAG TGTGCTCTTA GCTCTACCGG GAGGACAGCT 1200 

GCCTGTTTCC TCCCCATCAG CCTCCTCCCC ACATCCCCAG CTGCCTGGCT GGGTCCTGAA 1260 

GCCCTCTGTC TACCTGGGAG ACCAGGGACC ACAGGCCTTA GGGATACAGG GGGTCCCCTT 132 0 

CTGTTACCAC CCCCCACCCT CCTCCAGGAC ACCACTAGGT GGTGCTGGAT GCTTGTTCTT 1380 

TGGCCAGCCA AGGTTCACGG CGATTCTCCC CATGGGATCT TGAGGGACCA AGCTGCTGGG 144 0 

ATTGGGAAGG AGTTTCACCC TGACCRTTGC CCTAGCCAGG TTCCCAGGAG GCCTCACCAT 1500 

ACTCCCTTTC AGGGCCAGGG CTCCAGCAAG CCCAGGGCAA GGATCCTGTG CTGCTGTCTG 1560 

GTTGAGAGCC TGCCACCGTG TGTCGGGAGT GTGGGCCAGG CTGAGTGCAT AGGTGACAGG 1620 

GCCGTGAGCA TGGGCCTGGG TGTGTGTGAG CTCAGGCACT AGGTGCGCAG TGTGGAGACG 1680 

GGTGTTGTCG GGGAAGAGGT GTGGCTTCAA AGTGTTGTGT GTGCAGGGGG TKGGTGTGTT 174 0 

AAGCGTGGGT TAGGGGAACG TGTGTGCGCG TGCTGGTGGG CATGTGAGAT GAGTGACTGC 1800 

CGGTGAATGT GTCCACAGTT GAGAGGTTGG AGCAGGATGA GGGAATCCTG TCACCATCAA 1860 

TAATCACTTG TGGAGCGCCA CTTGGCCCAA GACGCCACCT GGGCGGACAG CAGGAGCTCT 1920 

CCATGGCCAG GCTGCCTGTG TGCATGTTCC CTGTCTGGTG CCCCTTTGCC CGCCTCCTGC 1980 

AAACCTCACA GGGTCCCCAC ACAACAGTGC CCTCCAGAAG CAGCCCCTCG GAGGCAGAGG 204 0 

AAGGAAAATG GGGATGGCTG GGGCTCTCTC CATCCTCCTT TTCTCCTTGC CTTCGCATGG 2100 

CTGGCCTTCC CCTCCAAAAC CTCCATTCCC CTGCTGCCAG CCCCTTTGCC ATAGCCTGAT 216 0 

TTTGGGGAGG AGGAAGGGGC GATTTGAGGG AGAAGGGGAG AAAGCTTATG GCTGGGT CTG 222 0 

GTTTCTTCCC TTCCCAGAGG GTCTTACTGT TCCAGGGTGG CCCCAGGGCA GGCAGGGGCC 2280 

ACACTATGCC TGCGCCCTGG TAAAGGTGAC CCCTGCCATT TACCAGCAGC CCTGGCATGT 234 0 

TCCTGCCCCA CAGGAATAGA ATGGAGGGAG CTC CAGAAAC TTTCCATCCC AAAGGCAGTC 24 00 

TCCGTGGTTG AAGCAGACTG GATTTTTGCT CTGCCCCTGA CCCCTTGTCC CTCTTTGAGG 2460 

GAGGGGAGCT ATGCTAGGAC TCCAACCTCA GGGACTCGGG TGGCCTGCGC TAGCTTCTTT 2520 

TGATACTGAA AACTTTTAAG GTGGGAGGGT GGCAAGGGAT GTGCTTAATA AATCAATTCC * 2580 
AAGCCT CAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



BFG7 Protein sequence (SEQ ID NO: 17) 

Gene name: EST; Unigene number: Hs . 91668; Probeset Accession #: Z40805; Protein Accession 
#: n/a; Signal sequence: none; Predicted TM domains: 117-133, 179-195, 211-227, 235-251, 
266-282, 296-312; PFAM domains: none; Summary: A type III membrane protein of unknown 
function; is adjacent to HER2 on the genome, and its overexpression in breast cancer is highly 
correlated with HER2 expression; may be used to predict HER2 overexpression and amplification. 



RVDPRVRGER MAGLAARLVL LAGAAALASG SQGDREPVYR DCVLQCEEQN CSGGALNHFR 60 

SRQPIYMSLA GWTCRDDCKY ECMWVTVGLY LQEGHKVPQF HGKWPFSRFL FFQEPASAVA 120 

SFLNGLASLV MLCRYRTFVP ASSPMYHTCV AFAWVS LNAW FWSTVFHTRD TDLTEKMDYF 180 

CASTVILHSI YLCCVRTVGL QHPAWSAFR ALLLIiMLTVH VSYLSLIRFD YGYNLiVANVA 24 0 

I6LVNWWWL AWCLWNQRRL PHVRKCVWV LLLQGLSLLE LLDFPPLFWV LDAHAIWHIS 300 
TIPVHVLFFS FLEDDS LYLL KESEDKFKLJD 



BCN4 DNA sequence (SEQ ID NO: 18) 

Gene name: ESTs; Unigene number: Hs. 283713; Probeset Accession #: F13673; Nucleic Acid 
Accession #: n/a; Coding sequence: 143-874 (start and stop codons underlined) 



GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 12 0 

CGCTGCCCGG CAGCCGGGAG CC ATG CGACC CCAGGGCCCC GCCGCCTCCC CGC AG CGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 24 0 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGG CCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 

CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 54 0 

AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT T C AACAATT A ATATTCATCG 720 

CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 

CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 84 0 

TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAA TGCTT TAATTTTCAT TTGCTACCTC 900 

TTTTTTTATT ATGC CTTGG A ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 102 0 

TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 

TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 114 0 

GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTT AAAAAA ATATAAAAGC TACCAATCTT 1200 

TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 



BCN4 Protein sequence (SEQ ID NO: 19) 

Gene name: ESTs; Unigene number: Hs. 2 83713; Probeset Accession #: F13 673; Protein Accession 
#: n/a; Predicted Signal sequence: 1-30; TM domains: none; PFAM domains: none; Summary: a 
secreted protein; has a mouse orthologue (see sequence below) . 



102 



MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 

GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLR2SFEE SWTPNYKQCS WSSLNYGIDL 120 

GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 180 

GSPEMNSTIN IKRTSSVE3L C EG I GAG LVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK 



Mouse BCN4 Protein sequence (SEQ ID NO: 20) 
Gene name: EST3; Unigene number: Mm. 41556 

XXXXAAPPQL LLGLFLVLLL LLQLSAPSSA SENPKVKQKA LIRQREWDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PCQDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPP PIEAIXXXXX 180 
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXSD Y P KGD AYTGW DSVSRIIIEE 24 0 
LPK 
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